PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG014612t1
Common NameTCM_014612
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family Trihelix
Protein Properties Length: 1005aa    MW: 112016 Da    PI: 9.6815
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG014612t1genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix65.31.3e-20912992186
          trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqle 86 
                       +W+ +ev++Li++r ++++r++  k +++lWee+s  ++++g++rsp qCk+ w +l ++y++ k  +k++     +++pyf+++ 
  Thecc1EG014612t1 912 KWKPEEVKKLIKMRGKLHSRFQVVKGRMALWEEISTSLMAEGISRSPGQCKSLWTSLVQKYEESKGEKKSH-----KEWPYFEDMS 992
                       7*************************************************************999999986.....68******96 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:3.60.15.101.2E-67195445IPR001279Metallo-beta-lactamase
SuperFamilySSF562816.89E-71196610IPR001279Metallo-beta-lactamase
SMARTSM008494.6E-26208405IPR001279Metallo-beta-lactamase
PfamPF127067.9E-11220355IPR001279Metallo-beta-lactamase
PfamPF075211.2E-5551582IPR011108Zn-dependent metallo-hydrolase, RNA specificity domain
PROSITE profilePS500908.014905969IPR017877Myb-like domain
Gene3DG3DSA:1.10.10.602.9E-5908971IPR009057Homeodomain-like
PfamPF138375.7E-17910993No hitNo description
CDDcd122033.37E-21911976No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0009658Biological Processchloroplast organization
GO:0009942Biological Processlongitudinal axis specification
GO:0060918Biological Processauxin transport
GO:0009507Cellular Componentchloroplast
GO:0016021Cellular Componentintegral component of membrane
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 1005 aa     Download sequence    Send to blast
MKKVKKKRRK NLYTCGKNTR GRNLSCFKND KQDISPPYKN LHTFTHSLNF PLPEAQSSPL  60
LLVSKLLHKN AKCSGFNLHC CFFFFVNLKE QMQLGFLGGL SFSYSLYFTS FKPIKAPTKM  120
AASTAHSLCP YGLYCRPNPR HRYISCSVGS PTPLGTRRTK VPRKKSGRLD GARKSMEDSV  180
QRKMEQFYEG TAGPPLRVLP IGGLGEIGMN CMLVGNYDRY ILIDAGVMFP DYDELGVQKI  240
IPDTTFIKKW SHKIEAVVIT HGHEDHIGAL PWVIPALDSH TPIYASSFTM ELIKKRLKEN  300
GIFVPSRLKI FKTRKRFMAG PFEIEPLRVT HSIPDCCGLV LRCADGTILH TGDWKIDESP  360
LDGKIFDRQF LEDLSKEGVT LMMSDSTNVL SPGRTISESS VADALLRHIS AAKGRIITTQ  420
FASNIHRLGS VKAAADLTGR KLVFVGMSLR TYLDAAWKDG KAPIDPSTLV KVEDIDAYAP  480
KDLIIVTTGS QAEPRAALNL ASYGSSHSFK LNKEDVILYS AKVIPGNESR VMKMLNRISE  540
IGSTIVMGKN EGLHTSGHGY RGELEEVLKI VKPQHFLPIH GELLFLKEHE LLGKSTGIRH  600
TTVIKNGEML GVSHLRNRRV LSNGFSSLGK ENLQLMYSDG DKAYGTSTEL CIDERLRIAS  660
DGIIVVSMEI LRPQKIDGIM ENSLKGKIRI TTRCLWLDKG KLLDALHKAA HAALSSCPVN  720
CPLGHMERTV SEVLRKMVRK YSGKRPEVIA IALENPAGVF SDELNERLSG NYNVGFEIPT  780
LRKVVDGHPK RSQPNKIKAE DDSNLHLENT SEQSLEVSDG EVEKLLPEED TTTSSPDSLE  840
RHTPNSEGSD EFWKSFITSS SPVNNLVNDN NGLVPKKEYK SQLKSDGTAS SGDDSEMPSS  900
QPKSSKPAKR NKWKPEEVKK LIKMRGKLHS RFQVVKGRMA LWEEISTSLM AEGISRSPGQ  960
CKSLWTSLVQ KYEESKGEKK SHKEWPYFED MSKVFSDFEA TATK*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5a0t_A1e-8419675419560RIBONUCLEASE J
5a0t_B1e-8419675419560RIBONUCLEASE J
5a0v_A1e-8419675419560RIBONUCLEASE J
5a0v_B1e-8419675419560RIBONUCLEASE J
Search in ModeBase
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
119KKVKKKRRK
248KKKRR
349KKKRRK
459KKRRK
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_007037950.10.0RNA-metabolising metallo-beta-lactamase family protein
TrEMBLA0A061FYV10.0A0A061FYV1_THECC; RNA-metabolising metallo-beta-lactamase family protein
STRINGVIT_17s0000g01640.t010.0(Vitis vinifera)
STRINGPOPTR_0012s09780.10.0(Populus trichocarpa)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM95392634
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G63420.10.0Trihelix family protein
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]